Event-driven configuration of a neural network CMP system over an homogeneous interconnect fabric

نویسندگان

  • Mukaram M. Khan
  • Alexander D. Rast
  • Javier Navaridas
  • X. Jin
  • Luis A. Plana
  • Mikel Luján
  • Steve Temple
  • Cameron Patterson
  • Dominic Richards
  • John V. Woods
  • José Miguel-Alonso
  • Stephen B. Furber
چکیده

Configuring a million-core parallel system at boot time is a difficult process when the system has neither specialised hardware support for the configuration process nor a preconfigured default state that puts it in operating condition. The architecture of SpiNNaker, a parallel chip multiprocessor (CMP) system for neural network simulation, is in this class. To function as a universal neural chip, SpiNNaker uses an event-driven model with complete system virtualisation so that all components are generic and identical. Where most large CMP systems feature a sideband network to complete the boot process, SpiNNaker has a single homogeneous network interconnect for both application inter-processor communications and system control functions. This network improves fault tolerance and makes it easier to support dynamic run-time reconfiguration, however, it requires a boot process compatible with the application’s communications model. Here, we present such a boot loader, capable of bringing a generic, initially unconfigured parallel system into a working configuration. Since SpiNNaker uses event-driven asynchronous communications throughout, the loader operates with purely local control: there is no global synchronisation, state information, or transition sequence. A novel two-stage ‘‘unfolding’’ boot-up process efficiently configures the SpiNNaker hardware and loads the application using a highspeed flood-fill technique with support for run-time reconfiguration. SystemC simulation of a multi-CMP SpiNNaker system indicates an error-free CMP configuration time of 1.37 ms, while a high-level simulation of a full-scale system (64 K CMPs) indicates a mean application-loading time of 20 ms (for a 100 KB application), which is virtually independent of the size of the system. Further hardware-level Verilog simulation verified the cycle-accurate functionality of CMP configuration. The complete process illustrates a useful method for configuring large-scale event-driven parallel systems without having to provide dedicated hardware boot support or rely on system state assumptions. 2011 Elsevier B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A COMPREHENSIVE STUDY ON THE CONCRETE COMPRESSIVE STRENGTH ESTIMATION USING ARTIFICIAL NEURAL NETWORK AND ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM

This research deals with the development and comparison of two data-driven models, i.e., Artificial Neural Network (ANN) and Adaptive Neuro-based Fuzzy Inference System (ANFIS) models for estimation of 28-day compressive strength of concrete for 160 different mix designs. These various mix designs are constructed based on seven different parameters, i.e., 3/4 mm sand, 3/8 mm sand, cement conten...

متن کامل

Design of an Adaptive-Neural Network Attitude Controller of a Satellite using Reaction Wheels

In this paper, an adaptive attitude control algorithm is developed based on neural network for a satellite using four reaction wheels in a tetrahedron configuration. Then, an attitude control based on feedback linearization control is designed and uncertainties in the moment of inertia matrix and disturbances torque have been considered. In order to eliminate the effect of these uncertainties, ...

متن کامل

A comparison between knowledge-driven fuzzy and data-driven artificial neural network approaches for prospecting porphyry Cu mineralization; a case study of Shahr-e-Babak area, Kerman Province, SE Iran

The study area, located in the southern section of the Central Iranian volcano–sedimentary complex, contains a large number of mineral deposits and occurrences which is currently facing a shortage of resources. Therefore, the prospecting potential areas in the deeper and peripheral spaces has become a high priority in this region. Different direct and indirect methods try to predict promising a...

متن کامل

Adaptive RBF network control for robot manipulators

TThe uncertainty estimation and compensation are challenging problems for the robust control of robot manipulators which are complex systems. This paper presents a novel decentralized model-free robust controller for electrically driven robot manipulators. As a novelty, the proposed controller employs a simple Gaussian Radial-Basis-Function Network as an uncertainty estimator. The proposed netw...

متن کامل

Learning Document Image Features With SqueezeNet Convolutional Neural Network

The classification of various document images is considered an important step towards building a modern digital library or office automation system. Convolutional Neural Network (CNN) classifiers trained with backpropagation are considered to be the current state of the art model for this task. However, there are two major drawbacks for these classifiers: the huge computational power demand for...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Parallel Computing

دوره 37  شماره 

صفحات  -

تاریخ انتشار 2011